Anomaly Detection in Cybersecurity Datasets via Cooperative Co-evolution-based Feature Selection
نویسندگان
چکیده
Anomaly detection from Big Cybersecurity Datasets is very important; however, this a challenging and computationally expensive task. Feature selection (FS) an approach to remove irrelevant redundant features select subset of features, which can improve the machine learning algorithms’ performance. In fact, FS effective preprocessing step anomaly techniques. This article’s main objective quantify accuracy scalability both supervised unsupervised effort, novel using FS, called Detection Using Selection (ADUFS), has been introduced. Experimental analysis was performed on five different benchmark cybersecurity datasets with without feature performance techniques were investigated. The experimental results indicate that instead original dataset, dataset reduced number yields better in terms true positive rate (TPR) false (FPR) than existing for detection. For example, technique, multilayer perception increased TPR by over 200% decreased FPR about 97% KDD99 dataset. Similarly, local outlier factor more 40% 15% 36% Windows 7 NSL-KDD datasets, respectively. addition, all require less computational time when suitable rather entire datasets. Furthermore, have compared six other state-of-the-art based decision tree (J48).
منابع مشابه
Fast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets
Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wr...
متن کاملMulti-task Feature Selection based Anomaly Detection
Network anomaly detection is still a vibrant research area. As the fast growth of network bandwidth and the tremendous traffic on the network, there arises an extremely challengeable question: How to efficiently and accurately detect the anomaly on multiple traffic? In multi-task learning, the traffic consisting of flows at different time periods is considered as a task. Multiple tasks at diffe...
متن کاملfast sffs-based algorithm for feature selection in biomedical datasets
biomedical datasets usually include a large number of features relative to the number of samples. however, some data dimensions may be less relevant or even irrelevant to the output class. selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. to this end, this paper presents a hybrid method of filter and wr...
متن کاملFeature-based anomaly detection
A feature-based approach for detecting anomalies in spectral, spatial, temporal, and other domains is described. When the frequency of occurrence is small relative to the background, anomalies such as man-made objects in natural image backgrounds do not form their own clusters, but are instead assigned the nearest background cluster, becoming an outlier (statistical anomaly) in that cluster. Ou...
متن کاملFeature Selection for SVM-Based Vascular Anomaly Detection
This work explores feature selection to improve the performance in the vascular anomaly detection domain. Starting from a previously defined classification framework based on Support Vector Machines (SVM), we attempt to determine features that improve classification performance and to define guidelines for feature selection. Three different strategies were used in the feature selection stage, w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM transactions on management information systems
سال: 2022
ISSN: ['2158-656X', '2158-6578']
DOI: https://doi.org/10.1145/3495165